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Abstract 

We consider infinite sequences of symbols, also known as streams, and 
the decidability question for equality of streams defined in a restricted 
format. This restricted format consists of prefixing a symbol at the head 
of a stream, of the stream function 'zip', and recursion variables. Here 'zip' 
interleaves the elements of two streams in alternating order, starting with 
the first stream; e.g., for the streams defined by { zeros = : zeros, ones = 
1 : ones, alt = : 1 : alt} we have zip(zeros, ones) = alt. The celebrated 
Thue-Morse sequence is obtained by the succinct 'zip-specification' 

M = 0:X X=l:zip(X,Y) Y = 0:zip(Y,X) 

Our analysis of such systems employs both term rewriting and coal- 
gebraic techniques. We establish decidability for these zip-specifications, 
employing bisimilarity of observation graphs based on a suitably chosen 
cobasis. The importance of zip-specifications resides in their intimate con- 
nection with automatic sequences. The analysis leading to the decidability 
proof of the 'infinite word problem' for zip-specifications, yields a new and 
simple characterization of automatic sequences. Thus we obtain for the 
binary zip that a stream is 2-automatic iff its observation graph using the 
cobasis (hd, even, odd) is finite. Here odd and even have the usual recur- 
sive definition: even (a : s) = a : odd(s), and odd (a : s) = even(s). The 
generalization to zip-A: specifications and their relation to fc-automaticity 
is straightforward. In fact, zip-specifications can be perceived as a term 
rewriting syntax for automatic sequences. Our study of zip-specifications 
is placed in an even wider perspective by employing the observation graphs 
in a dynamic logic setting, leading to an alternative characterization of 
automatic sequences. 

We further obtain a natural extension of the class of automatic se- 
quences, obtained by 'zip-mix' specifications that use zips of different 
arities in one specification (recursion system). The corresponding notion 
of automaton employs a state-dependent input-alphabet, with a number 
representation (ti)a = d m ... do where the base of digit di is determined 
by the automaton A on input di-i . . . do- 

We also show that equivalence is undecidable for a simple extension 
of the zip-mix format with projections like even and odd. However, it 
remains open whether zip-mix specifications have a decidable equivalence 
problem. 



1 Introduction 



Infinite sequences of symbols, also called 'streams', are a playground of common 
interest for logic, computer science (functional programming, formal languages, 
combinatorics on infinite words), mathematics (numerations and number the- 
ory, fractals) and physics (signal processing) . For logic and theoretical computer 
science this interest focuses in particular on unique solvability of systems of re- 
cursion equations defining streams, on expressivity (what scope does a definition 
or specification format have), and productivity (does a stream specification in- 
deed unfold to its intended infinite result without stagnation). In addition, 
there is the 'infinitary word problem': when do two stream specifications over a 
first-order signature define the same stream? And, is that question decidable? 
If not, what is the logical complexity? 

Against this background, we can now situate our present paper. In the 
landscape of streams there are some well-known families, with automatic se- 
quences [2] as a prominent family, including members such as the Thue-Morse 
sequence [1]. Such sequences are defined in first-order signature that includes 
some basic stream functions such as hd (head), tl (tail), ':' (prefixing a sym- 
bol to an infinite stream), even, odd; all these are familiar from any functional 
programming language. 

One stream function in particular is frequently used in stream specifications. 
This is the zip function, that 'zips' the elements of two streams in alternating 
order, starting with the first stream. Now there is an elegant definition of 
the Thue-Morse sequence M using only this function zip, next to prefixing an 
element, and of course recursion variables: 

M=0:X X = l:zip(X,Y) Y = 0:zip(Y,X) (1) 

For general term rewrite systems, stream equality is easily seen to be un- 
decidable [17], just as most interesting properties of streams. But by adopting 
some restrictions in the definitional format, decidability may hold. 

Thus we consider the problem whether definitions like the one of M, using 
only zip next to prefixing and recursion, are still within the realm of decidability. 
Answering this question positively turned out to be rewarding. In addition to 
solving the technical problem, the analysis leading to the solution had a useful 
surprise in petto: it entailed a new and simple characterization of the important 
notion of fe-automaticity of streams. (The same 'aha-insight' was independently 
obtained by Kupke and Rutten, preliminary reported in [14].) 

The remainder of the paper is devoted to an elaboration of several aspects 
concerning zip-specifications and automaticity. First, we treat a representation 
of automatic sequences in a framework of propositional dynamic logic, employ- 
ing cobases and the ensuing observation graphs (used before for the decidability 
of equivalence) as the underlying semantics for a dynamic logic formula charac- 
terizing the automaticity of a stream. Second, we are led to a natural general- 
ization of automatic sequences, corresponding to mixed zip-specifications that 
contain zip operators of different arities. The corresponding type of automaton 
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Figure 1: Observation graph for the specification (1) 
of the Thue-Morse sequence M. 

employs a state-dependent alphabet. Third, we show that stream equality for 
a slight extension of zip-specifications is 11°; the latter via a reduction from the 
halting problem of Fractran programs [7] . 

Let us now describe somewhat informally the key method that we employ to 
solve the equivalence problem for zip-specifications. To that end, consider the 
specification (1) above with root variable M. This specification is productive [19, 
8, 10] and defines the Thue-Morse sequence: 

M ->•" 0:1:1:0:1:0:0:1:1:0:0:1:0:1:1:0:..., 

that is, by repeatedly applying rewrite rules that arise by orienting the equations 
for M, X and Y from left to right, M rewrites in the limit to the Thue-Morse 
sequence [1]. 

We will construct so-called 'observation graphs' based on the stream cobasis 
(hd, even, odd) where all nodes have a double label: inside, a term corresponding 
to a stream (such as M and : X in Figure 1) and outside, the head of that 
stream. The nodes have outgoing edges to their even- and odd-derivatives. An 
example is shown in Figure 1. 

So, the problem of equivalence of zip-specifications reduces to the problem 
of bisimilarity of their observation graphs, which we prove to be finite. This 
does not hold for observation graphs of zip-specifications with respect to the 
cobasis {hd, tl): for this cobasis, the above specification would yield an infinite 
observation graph. (The same would hold for any stream which is not eventually 
periodic.) 

The observation graph in Figure 1 evokes the 'aha-insight' mentioned above: 
it can be recognized as a DFAO 1 (deterministic finite automaton with output) 
that witnesses the fact that M is a 2-automatic sequence [2]. 

We will exhibit the close connection between zip-specifications and automatic 
sequences, residing in the coincidence of DFAOs and observation graphs. 



2 Zip-Specifications 

For term rewriting notions see further [21]. For k E N wc define N<fe = 
{0, 1, . . . , k — 1}. Let A be a finite alphabet of at least two symbols, and X 
a finite set of recursion variables. 

lr The bisimulation collapse of the graph in Fig. 1 identifies the states labeled M and : X, 
giving rise to the familiar (minimal) DFAO for M. 
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Definition 1. The set A" of streams over A is defined by A w = {a \ a : N — > 
A}. 

We write a : a for the stream r defined by r(0) = a and r(n + 1) = a(n) 
for all neN. We define hd : A w ->■ A and tl : A w ->■ A" by hd(x : a) = z and 
t\(x : a) = a. 

Convention 2. We usually mix notations for syntax (term rewriting) and se- 
mantics ('real' functions). Whenever confusion is possible, we use fonts fun, and 
fun to distinguish between functions, and term rewrite symbols, respectively. 

Definition 3. For k e N >0 , the function zip k : (A UJ ) k -> A w is defined by the 
following rewrite rule: 

z\p k (x : ct ,cti, . . . ,<7 fe _i) -> x : zip fe (ai, . . .,a k -i,a ) 

Thus zip k interleaves its argument streams: 

zip k (a a , . . .,a k -i)(kn + i) = a^ri) (0 < i < k) 

Definition 4. The set 2(A, X) of zip-terms over (A, X) is defined by the 
grammar: 

Z ::=X\a:Z\z\p k (Z 11 ^ 1 Z) (X e X, a e A, k e N) 

times 

A zip-specification S over (A, X) consists of a distinguished variable Xq e 
called the root of S, and for every X <G X a pair (X, t) with t 6 Z(A, X) a zip- 
term. We treat these pairs are term rewrite rules, and write them as equations 
X = t. 

Definition 5. For k G N, the set Z k (A, X) of zip-k terms is the restriction of 
Z(A, X) to terms where for every occurrence of a symbol zip £ (£ <G N) it holds 
that i = k. 

A zip-k specification is a zip-specification such that for all equations X = t 
it holds that t e Z k (A, X). 

We always assume for zip-specifications S that every recursion variable is 
reachable from the root Xq . 

2.1 Unique Solvability, Productivity and Leftmost Cycles 

Definition 6. A valuation is a mapping a : X — > A". Such a valuation a 
extends to [-J Q : Z(A,X) A w as follows: 

[X] Q = a(X) 
[a : t} a = a : [t] a 
[zip fc (ti,...,i fe )] a = zip kdhja, . . . , p fc ] a ) 
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A solution for a zip-specification S is a valuation a : X — > A", denoted a \= S, 
such that [[X] a = [t] Q for all X = t € S. 

A zip-specification S is uniquely solvable if there is a unique solution a for 
S; then we let [-J 5 = a denote this solution. 

Definition 7. Let <S and S' be zip-specifications with roots X and X , respec- 
tively. Then S is called equivalent to S' if they have the same set of solutions 
for their roots: 

{lX j a \a^S} = { [X ] Q , I h } 

Definition 8. A zip-specification 5 with root Xo is productive if there exists a 
reduction of the form Xo — >* ai : . . . : a n : t for all neN. If a zip-specification 
S is productive, then S is said to define the stream [XoJ 5 where Xo is the root 
of S. 

Note that if a specification is productive, then by confluence of orthogo- 
nal term rewrite systems [21], there exists a rewrite sequence of length u> that 
converges towards an infinite stream term a\ : a 2 : 03 : . . . in the limit. 

While productivity is undecidable [9, 20] for term rewrite systems in general, 
zip-specifications fall into the class of 'pure stream specifications' [8, 10] for 
which (automated) decision procedures exist. However, the latter would be 
taking a sledgehammer to crack a nut. For zip-specifications, productivity boils 
down to a simple syntactic criterion. 

Definition 9. Let S be a zip-specification. A step in S is pair of terms (s, t), de- 
noted by s ~> t, such that (a) s — > t e S, (b) s = a : t, or (c) s = zip fc (. . . , t, . . .). 
A guard is a step of form (b) . A left-step s t in S is a step s ~> t of the form 
(a), (b) or (c') s = zip fe (t,...). 

A cyc/e m <S is a sequence ti,t 2 , ■ ■ ■ ,t n such that t\ = t n £ X and ij ~> 
for 1 < i < n. A leftmost cycle in S is a cycle ii, <2, . . . , t n such that tj ^£ t i+ i 
for 1 < i < n. 

Example 10. Consider the following specification 

X 
Y 
Z 

visualized as the cyclic 
most cycle Y ^1 zip(7 
not guarded. 

For term rewriting systems in general, productivity implies the uniqueness 
of solutions, but unique solvability is not sufficient for productivity. For zip- 
specifications it turns out that both concepts coincide. Here we need that A is 
not a singleton — otherwise every specification has a unique solution. 

Theorem 11. For zip- specifications S these are equivalent: 



= zip(l:X,Y) 
= zip(Z,X) 
= zip(Y,0:Z) 



; term graph on the right. The left- 
I,X) Z^t zip(Y,0:Z) ^ e Y is 
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(i) S is uniquely solvable, 
(ii) S is productive. 

(Hi) S has a guard on every leftmost cycle. 

2.2 Evolving and Solving Zip-Specifications 

The key to the proof of Theorem 1 1 consists of a transformation of zip-specific- 
ations by (i) simple equational logic steps, and (ii) internal rewrite steps. 

Definition 12. For zip-specifications S,S' we say S evolves to 5', denoted by 
SOS', if one of the conditions holds: 

(i) S contains an equation X = a : t with X ^ X and S' is obtained from S 
by: let X' be fresh and 

(a) exchange the equation X = a : t for X' = t, then 

(b) replace all X in all right-hand sides by a : X', and 

(c) finally rename X' to X (X is no longer used). 

(ii) S contains an equation X = t such that t rewrites to t' via a zip-rule 
(Definition 3), and S' is obtained from S by replacing the equation X = t 
with X = t'. 

The condition X ^ X in clause (i) guarantees that the meaning (its solution) 
is preserved under evolving. It prevents transforming a specification like Xo = 
: 1 : Xo into Xo = 1 : : Xo which clearly has a different solution. 

Lemma 13. Let SOS'. Then for every a : X — > A" it holds that a is a 
solution of S if and only if a is a solution of S' . Moreover, if S is productive 
then so is S' . 

Definition 14. A zip-specification S is said to have a free root if the root X 
of S docs not occur in any right-hand side of S. 

Lemma 15. Every zip- specification can be transformed into an equivalent one 
with free root. 

The following lemma relates rewriting to evolving: 

Lemma 16. Let S be a zip- specification with free rootXo. There exists a reduc- 
tion Xo a\ : . . . : a n : t in S if and only if there exists a zip-specification S' 
such that S Cj* S' and S' contains an equation of the form X = a\ : . . . : a n : t' . 
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Example 17. We evolve the following specification: 

X = zip(l:X,Y) Y = Oitl(zip(Z,X)) Z = zip(Y,0:Z) 
X = zip(l:X,07Y) Y = tl(zip(Z,X)) Z = zip((T[Y, : Z) 
... Y = tl(zip(Z,X)) Z=Ezip(0:Z,Y) 
... Y = tl(zip(OlZ,X)) Z = zip(0:tJ7Z,Y) 
... Y = tl(07zip(X, Z)) Z = zip(0:0:Z,Y) 
X = zip(l:X,0:Y) Y = zip(X,Z) Z = zip(0 : : Z, Y) 

Note that the contracted redexes are underlined and the created symbols are 
overlined. Also note that invoking a free root is not needed for the evolution 
above. 

Strictly speaking, the last step in the above example is not covered by Def- 
inition 12 since the rule for 'tl' is not included. We have chosen this example 
to demonstrate another principle. The specification we started from is obtained 
from Example 10 by inserting : tl(. . .) on an unguarded leftmost cycle. Evolv- 
ing has resulted in a productive zip-specification (now every leftmost cycle is 
guarded) that represents a solution of the original specification. Similarly, by 
inserting 1 : tl(. . .), we obtain the solution: 

X = zip(l:X,l:Y) Y = zip(X,Z) Z = zip(0 : 1 : Z, Y) 

The insertion of : tl(. . .) and 1 : tl(. . .) corresponds to choosing whether we are 
interested in a solution for Y starting with head or 1. To see that the result 
of the insertions are valid solutions it is crucial to observe that the symbol 'tl' 
in the inserted a : tl(. . .) disappears by consuming a 'descendant' of the element 
a E A. In general we have: 

Lemma 18. Let S be a zip-specification. Define the set {Yi, . . . , Y m } to contain 
precisely one recursion variable from every unguarded leftmost cycle from S. 

Let a — (ai, . . . , a m ) E A" 1 and define S3 to be obtained from S by replacing 
each equation Y, = ti by Y, = Oj : tl(ij). Subsequently, we can by the evolving 
procedure eliminate the occurrences of the symbol tl as in Example 17. Then 
S3 is productive, and the unique solution [-J 52 : X — > A" is a solution of S. 
Hence, {S3 | a E A™ 1 } is the set of all solutions of S, in particular, S has \ A\ m 
different solutions. 

2.3 Formats of Zip-Specifications 

Definition 19. A zip-specification S is called flat if each of its equations is of 
the form: 

X 4 = c,,i : . . . : c^ mi : zip^Xj,!, . . . , X^fcJ (0 < i < n) 

for mi, ki EN, ki > 2, recursion variables X^X^i, . . . ,Xj j / Si and data constants 
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Zip-free cycles correspond to periodic sequences, and these can be specified 
by flat zip-fc specifications. Together with unfolding and introduction of fresh 
variables we then obtain: 

Lemma 20. Every productive zip-k specification can be transformed into an 
equivalent productive, flat zip-k specification. 

3 Zip-Specifications and Observation Graphs 

For the decidability result and the connection with automaticity we need to 
observe streams and compare them. This is done with observations in terms of 
a cobasis and bisimulations to compare the resulting graphs. 

3.1 Cobases, Observation Graphs, and Bisimulation 

For general introductions to coalgebra we refer to [4, 18]. We first introduce the 
notion of 'cobasis' [16, 13]. For the sake of simplicity, we restrict to the single 
observation hd. 

Definition 21. A stream cobasis B = (hd, (71, . . . , 7fe}} is a tuple consisting of 
operations 7, : A" ->■ A" (1 < i < k) such that for all a,T g A" it holds that 
a = t whenever 

hd( 7il (...(7i„(a))...)) = hd(7i 1 (...(7i„(T))...)) 

for all n G N and 1 < i\, . . . , i n < k. 

As hd is integral part of every stream cobasis, we suppress hd and write 
(71, . . . ,7fc) as shorthand for (hd, (71, . . . ,7fe)). 

Definition 22. For i G N, k e N >0 define 7r a : A" A w : 

7i"o,fc(z : o-) -> x : 7r fe _i ife ((j) 
ir i+ i, k (x : a) 7r iife (cr) 

For every k > 2 we define two stream cobases: 

A4 = (7r ,fc, . . . , TTk-l,k) Ok = (TTl.fe, • • • , 7Tfc,fc) 

Note that ^^(a) selects an arithmetic subsequence of c; it picks every fc-th 
element beginning from index i: 7r ij fe(cr)(n) = a(kn + i). The Wi t k are generalized 
even and odd functions, in particular we have: tl = even = n ^ and 

odd = 7ri,2- 

Observe that A4 and O fc are cobases, that is, every element of a stream 
can be observed. The main difference between A4 and Ok is that Nk has an 
ambiguity in naming stream entries: hd (cr) = hd (even {a)). On the other hand, 
Ok is an orthogonal basis, names of stream entries are unambiguous. 

We employ the following simple coinduction principle: 
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Definition 23. Let B = (71, ... , 7*,) be a cobasis. A B -bisimulation is a relation 
R C A" x A" s.t. (ct,t) e R implies hd(cr) = hd(r) and (7,(0"), 7;(t)) £ i? for 
1 < i < k. 

Lemma 24. For all a, r e A" i£ ZioZrfs £/ia£ cr = t if and only if there exists a 
B -bisimulation R such that (<t,t) E R. □ 

We now further elaborate the coalgebraic perspective. The following def- 
inition formalizes '^-observation graphs' where B = (71,..., 7^) is a cobasis. 
Every node n will represent the stream [rafl E A" , and if the i-th outgoing edge 
of n points to node m then 7i([n]) = [raj. 

Definition 25. Let B = (hd, (71, . . . , 7fe)) be a stream cobasis, and let F be 
the functor F(X) = A x X k . 

A B-observation graph is an F-coalgebra Q = (S, (o, n)) with a distinguished 
root element r £ S, such that there exists an .F-homomorphism [■] : S — > A" 
from Q to the F-coalgebra (A U ,B) of all streams with respect to B: 



(o,n)| 



A" 
6 



id x If 
A x S k > A x (A") fe 

The observation graph Q is said to define the stream [rj e A". (We note that 
[•J is unique by Lemma 26, below.) 

Let a G A". The canonical B-observation graph of a is defined as the sub- 
coalgebra of the F-coalgebra (A W ,B) generated by cr, that is, the observation 
graph (T, B) with root a where T C A" is the least set containing a that is 
closed under 71, . . . , 7*,. The set ds{cr) of B-derivatives of a is the set of elements 
of the canonical observation graph of a. 

Lemma 26. For every B-observation graph the mapping [•] is unique whenever 
it exists. 

For the cobasis Ok, the existence of [-J is guaranteed: 

Proposition 27. The stream coalgebra (A w , Ok) is final for the functor F '(X) = 
A x X k . As a consequence, we have that every F -coalgebra is an Ok- observation 
graph. 

In contrast, the existence of [•] is not guaranteed for A4- The coalgebra 
(A",A4) is final for a subset of F-coalgebras, called zero- consistent, see fur- 
ther [14]. 

Definition 28. Let B = (hd, (71, . . . ,7*;}} be a stream cobasis. A bisimulation 
between B-observation graphs Q = (S, (o,n)) and Q' = (5", (o',n')) is a rela- 
tion R C S x 5" such that for all (s,s'} € R we have that o(s) = o'(s') and 
(ni(s), n-(s')} e R for all 1 < i < k, where denotes the i-th projection on 
n. Two observation graphs are bisimilar if there is a bisimulation relating their 
roots. 
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For deterministic transition systems, such as observation graphs, bisimilarity 
coincides with trace equivalence. As a consequence, the algorithm of Hopcroft- 
Karp [11] is applicable. 

Proposition 29. Bisimilarity of finite B-observation graphs is decidable (in 
linear time with respect to the sum of the number of vertices). 

Proposition 30. Let B be a stream cobasis. Two B-observation graphs define 
the same stream if and only if they are bisimilar. 

3.2 From Zip-A; Specifications To Observation Graphs 

We construct observation graphs for zip-fc specifications. 

Definition 31. Let X = {Xo, . . . , X„_i} be a set of recursion variables and A 
a finite set of data-constants. Let k G N, and S be a zip-fc specification over 
(A, X). We define the orthogonal term rewrite system TZk(S) to consist of the 
following rules: 

hd(a : a) — > a 
7i"o,fe(a : c) -> a : nk-i,k{<r) 
n i+ i t k{a : a) -> ^(cr) (0 < i < k + 1) 

hd(zip fe (cr , • • • , o-k-i)) -> hd(cr ) 
ir itk (z\p k (a , . . . ,CTfe_i)) -> tTi (0 < i < k) 

and additionally for every equation Xj = t of S the rules 

hd(Xj) -> hd(t) 

Ti,fc(Xj)->7Ti,fc(t) (0<i<fc+l) 

where the X^ are treated as constant symbols. 

Whenever S is clear from the context, then by t\. we denote the unique 
normal form of term t with respect to lZk(S). 

Definition 32. Let S be a productive, flat zip-fc specification with root X . 
The set 5k(S) is the least set containing X that is closed under \t.(ir it k(t)l) for 
every < i < k. 

Definition 33. Let S be a productive, flat zip-fc specification with root Xq. 
The A/fc- observation graph Q(S) is defined as: 

G(S) = (S k (S),(o,n)) o(t) = hd(t)4. 

n(t) = {TT ,k(t)l, ■ ■ ■ ,TTk-l,k(t)l) 

with root Xo. In words: every node t has 
(i) the observation hd(£)| (the label), and 
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(ii) outgoing edges to no,k{t)i, . . . , nk-i,k{t)i (in this order). 



Lemma 34. Let S be a productive, flat zip-k specification with root Xq. There 
exists in £ N such that every term in Sk(S) is of the form do : . . . : di-\ : Xj with 
£ < m, do, ■ ■ ■ ,de-i G A and Xj G X. As a consequence Sk(S) and Q(S) are 
finite. 

Proof Sketch. The equations of S are of the form: 



Let m := max {mi < i < n}. It suffices that the claimed shape is closed 
under As.7r, ; fe(s)4, for < i < k. This follows by a straightforward application of 
Definition 31 together with a precise counting of the 'produced' elements. □ 

We need to ensure that the rewrite system from Definition 31 implements 
(is sound for) the intended semantics; recall that S has a unique solution [-J 5 : 
X — > A w due to productivity: 

Lemma 35. Let S be a productive, flat zip-k specification with root Xq. For 
every t G Sk(S) and < i < k we have that hd(i) — >* hd([f]) and [7Ti,k (i)4-] = 
7T, ; fc([[f]]). Hence, the graph Q{S) is an Afk- observation graph defining [Xo] 5 . 

Proof. The extension of [J-] Q from Definition 6, interpreting the symbols TTi_k 
by the stream function m t k ■ A" — > A u for every < i < k, is a model of 



As an application of Lemmas 20 and 35 we get 

Lemma 36. For every productive zip-k specification with root X we can con- 
struct an A/fe- observation graph defining the stream [X ] 5 . 

We arrive at our first main result: 

Theorem 37. Equivalence of zip-k specifications is decidable. 

Proof. Lemma 18 allows to reduce the equivalence problem for unproductive 
zip-fc specifications to a finite number of equivalence problems for productive 
zip-fc specifications. Propositions 30, 29 and Lemma 36 imply decidability of 
equivalence for productive zip-fc specifications. □ 

Proposition 38. Equivalence of productive, flat zip- specifications is decidable 
in quadratic time. 

Example 39. Consider the zip-2 specification with root N: 



Xj - c jfi : 



■ Cj, mj -i ■ ziPfe(Xj,o, • • .,Xj- ife _i) (0 < j < n) 



n k (s). 



□ 



N =0:zip(l : W, 1 : U) 
V = 0:zip(V,l:U) 



U = l:zip(V,U) 
W = zip(N,V) 
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Its 7V2 -observation graph is depicted on the right above. The dashed lines 
indicate a bisimulation with the observation graph from Fig. 1 here depicted 
on the left. 



3.3 Prom Observation Graphs To Zip-A; Specifications 

Lemma 40. The canonical Ok-observation graph of a stream a e A" is finite 
if and only if a can be defined by a zip-k specification consisting of equations of 
the form: 

Xi = at : zip fe (X ij i,Xj i 2, . . . , Xj^) 

Proof For the translation forth and back, it suffices to observe the corre- 
spondence between an equation Y = a : zip fe (Yi, . . . , Yfe) and its semantics 
hd([Y]) = a, 7n lfc ([Y]) - M ■ ■ • , T fc , fc ([Y]) = I Y *J- □ 

Lemma 41. The canonical A4- observation graph of a stream a € A" is finite 
if and only if a can be defined by a zip-k specification consisting of pairs of 
equations of the form: 

Xi = dj : X^ 

K = z 'Pfc( X /(i,l)> X /(i,2), ■ • • > x /(a-i); X /(i,o)) 

over recursion variables X (J X' where X = {Xo, . . . , X„_i} and A"' = {XJ | Xj G 
A?}, and / : N<„ x N<fe — > N<„ smc/i fftai a/(i,o) = a i / or a ^ * G N< n . 

Proof. If Y = a : Y' and Y' = zip fe (Yi, . . . , Y fe _i, Y£,) then hd([YJ) = a, 
^o,fe([Y]) = a : [Y£], and 7r a ([Y]) = [Yj] (1 < i < fc). Since there also is 
an equation Y = a : Yq, it holds that [Yq]] = tl([Y ]]) and hence 7r fc([Y]) = 
[Vol- □ 



4 Automaticity and Observation Graphs 

After our first main result (Theorem 37) we proceed with connecting zip-fc 
specifications to fc-automatic sequences. 



4.1 Automatic Sequences 

Definition 42 ([2]). A deterministic finite automaton with output (DFAO) is 
a tuple (Q, E, S, qo, A, A) where 
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- Q is a finite set of states, 

- E a finite input alphabet, 

- <5 : Q x E ^ Q a transition function, 
qo G Q the initial state, 

- A an output alphabet, and 

- A : Q — > A an output function. 
We extend S to words over E as follows: 

S(q,e) = q for q G Q 

5(q, wa) = S(S(q, a),w) for q G Q, a G E, w G E* 

and we write £)(w;) as shorthand for 5(qo,w). 

For n, k G N, fc > 2, we use (n)/j to denote the representation of n with 
respect to the base k (without leading zeros). More precisely, for n > we have 
(n)k = n m n m -i ...no where < n m , . . . , n < k, n m > and n = Yn=o n & % \ 
for n = we fix (n)/j = e. 

Definition 43. A /c-DFAO A is a DFAO (Q, E, <5, q , A, A) with input alphabet 
E = N<fe. For q G Q, we define a stream C(A<z) by: g)(n) = X(S(q, (n)k)) 
for every n G N. 

We write as shorthand for qo). Moreover, we say that the automa- 
ton A generates the stream C(^)- 

Definition 44. A stream a : A" is called k-automatic if there exists a fc-DFAO 
that generates a. A stream is called automatic if it is /c-automatic for some 
k > 2. 

The exclusion of leading zeros in the number representation (n)k is not 
crucial for the definition of automatic sequences. Every /c-DFAO can be trans- 
formed into an equivalent fc-DFAO that ignores leading zeros: 

Definition 45. A fc-DFAO (Q, E, 6, qo, A, A) is called invariant under leading 
zeros if for all q G Q: X(q) — A(<5(<?, 0)). 

Lemma 46 ([2, Theorem 5.2.1 with Corollary 4.3.4]). For every k-DFAO A 
there is a k-DFAO A' that is invariant under leading zeros and generates the 
same stream (£(A) = ((A') ). 

Automatic sequences can be characterized in terms of their 'kernels' being 
finite. Kernels of a stream a are sets of arithmetic subsequences of a, defined 
as follows. 

Definition 47. The k-kernel of a stream a G A" is the set of subsequences 
{^ kP (a) \ P eN,KkP}. 

Lemma 48 ([2, Theorem 6.6.2]). A stream er is k-automatic if and only if the 
k-kernel of a is finite. 
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4.2 Observation Graphs and Automatic Sequences 

There is a close correspondence between observation graphs with respect to the 
cobasis M k and fc-DFAOs. For fc-DFAOs A that are invariant under leading 
zeros an edge q — > p labeled i implies that the stream generated by p is the 
""i,fc-P r ojection f the stream generated by q, that is, £(A,p) = ^^(^(^4, q)). 
The following lemma treats the case of general fc-DFAOs. 

Lemma 49. Let A = (Q, E, 6, qo, A, A) be a k-DFAO. Then for every q G Q we 
have: t\(((A,5(q,0))) = t\(TT ,k(C(A ?))) and for all 1 < i < k: 

aA,5(q,i))=ir i , k ({(Aq)) (2) 

Hence, if A is invariant under leading zeros, then property (2) holds for all 
< i < k. 

Proof. Follows immediately from (kn + i)k = {n)ki for all n G N and < i < k 
such that n ^ or i ^ 0. □ 

As a consequence of Lemma 49 we have that fc-DFAOs, that are invariant 
under leading zeros, are A/fc-observation graphs for the streams they define, and 
vice versa. Formally, this is just a simple change of notation 2 : 

Definition 50. Let A = (Q, E, 5, qo, A, A) be a fc-DFAO that is invariant under 
leading zeros. We define the A/fc-observation graph Q{A) = (Q,(o,n)) with 
root qo where for every q e Q: o(q) = X(q), ni(q) = 5(q,i) for i < k, and 
>I=C(.'tr/). 

Let Q = (S,(o,n)) be an A4-observation graph over A with root r G S. 
Then we define a fc-DFAO A{Q) as follows: A(Q) = (Q,N <k ,S,q ,A,X) where 
Q = S, qo = r, and for every s £ S: A(s) = o(s), and 8(s, i) — ni{s) for i < k. 

Proposition 51. For every k-DFAO A that is invariant under leading zeros, 
the TVfc- observation graph Q(A) defines the stream that is generated by A. 

Conversely, we have for every N k - observation graph Q, that the k-DFAO 
A{Q) is invariant under zeros and generates the stream defined by Q. 

Another way to sec the correspondence between automatic sequences and 
their finite, canonical A/fc-observation graphs is as follows. The elements of the 
canonical observation graph of a stream a, that is, the set of {7ro,fc, . . . , 7Tfc_i^}- 
derivatives of a, coincide with the elements of the fc-kernel of a. This is used in 
the proof of the following theorem. 

Proposition 52. For streams a G A" the following properties are equivalent: 
(i) The stream a is k-automatic. 

(ii) The canonical Af k - observation graph of a is finite. 

Note that even this small change of notation can be avoided by introducing fc-DFAOs as 
coalgcbras over the functor F(X) = A X X k as well. 
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Proof. The equivalence of (i) and (ii) is a consequence of Lemma 48 in com- 
bination with the observation that the set of functions {iTi^p P G N, i < k p } 
coincides with the set of functions obtained from arbitrary iterations of func- 
tions 7r 0j fc, • • • , 7Tfc-i,fc (that is, function compositions 71 • . . . • 7„ with neN and 

ll € {7r ,fe, . . . ,TV k -l,k})- □ 

Proposition 52 gives a coalgebraic perspective on automatic sequences. More- 
over, it frequently allows for simpler proofs or disproofs of automaticity than 
existing characterizations. For example, in the following sections we will derive 
observation graphs for streams that are specified by zip-specifications. Then it is 
easier to stepwise iterate the finite set of functions {ttq.Jc, ■ ■ ■ , ^k-i,k} than to rea- 
son about infinitely many subsequences in the kernel {^i.kv(a) \ p £ N, i < k p }. 

Proposition 52 was independently found by Kupke and Rutten, see Theo- 
rem 8 in their recent report [14]. 

We arrive at our second main result: 

Theorem 53. For streams a G A" the following properties are equivalent: 
(i) The stream a is k-automatic. 

(ii) The stream a can be defined by a zip-k specification. 

(in) The canonical Mk-observation graph of a is finite. 

(iv) The canonical Ok -observation graph of a is finite. 

Proof. We have that (i) O (Hi) by Theorem 52, (Hi) => (ii) by Lemma 41, and 
(ii) => (in) by Lemma 36. Moreover, it holds that (iv) => (ii) by Lemma 40. 

Finally, we show (Hi) =>■ (iv). Assume Q — (S, (o, n)) is a finite A4-observa- 
tion graph with root r defining a and let \\g : S — > A u be the unique 
F-homomorphism into (A",A4)- Let n — (n\, . . . , rife). Then o(s) = hd([s]g) 
and [rij(s)]g = 7Tj_i ; fe([s]g) for all 1 < i < k and s G S. We define Q' = 
(S', (o',n')) where S' = S U {ti(s) \ s G S}, o'(s) = o(s), o'(t|(s)) = o(n 2 (s)) 
n-(s) = n i+ i(s) for 1 < i < k, n' k (s) = tl(m(s)), n-(ti(s)) = n i+2 (s) for 
1 < i < k-2, n'^^s)) = tl(ni(s)) and n' k (t\(s)) = tl(n 2 (s)) with root r G 5". 
Let l-jg, : S' -»• A w be defined by [s] e - = fsjg and fti(s)h' = HMo)- lt 
can be checked that [•]§/ is an F-homomorphism into (A", Ok) with a — \r\gi . 
Hence Q' is an Cfe-observation graph defining a. □ 

5 A Dynamic Logic Representation 
of Automatic Sequences 

This section connects automatic sequences with expressivity in a propositional 
dynamic logic (PDL) derived from the cobases A4 and Ok- For simplicity, we 
shall restrict attention to the case of A = {0, 1} and J\f 2 — (hd, even, odd). 
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The set of sentences ip and programs tt of our version of PDL is given by the 
following BNF grammar: 



ip ::= | 1 | -up ip A ip [ir]p 
tt ::= even | odd | tt; tt \ tt U tt \ tt* 

We interpret PDL in an arbitrary f-coalgebra Q — (S, (o, n)). Actually, we can 
be more liberal and interpret PDL in models of the form 

Q = (5,0, 1, even, odd) 

where C S, 1 C S, even C S 2 , and odd C S 2 . These are more general than 
i^-coalgebras because we do not insist that D 1 = 0, or that even and odd 
be interpreted as functions. Nevertheless, these extra properties do hold in the 
intended model 

(A w , 0,1, even, odd) 

where is the set of streams whose head is the number 0, and similarly for 1; 
(cr, t) e even iff r = (<ro, 0-2,0-4, . . .), and similarly for odd. 

The interpretation of each sentence ip is a subset of S; the interpretation of 
each program tt is a relation on S, that is, a subset of S x S. The definition is 
as usual for PDL: 

| = {ieS:i£0} [even] = even 
[1] = {x g S : x e 1} [odd] = odd 

A V] = l<pj n M [7n;7r 2 ] = [7ri];[7r 2 ] 

H>] =S\M [7T1 U 7T 2 ] = [TTl] U [7T 2 ] 

[t*] = H* 
«*]H^(V#,!/)eW^i/eH)} 

In words, we interpret even and odd by themselves that correspond in the given 
model. We interpret ; by relational composition, U by union of relations, * by 
Klccne star (— reflexive-transitive closure) of relations, and we use the usual 
boolean operations and dynamic modality {n]ip. 

We use the standard boolean abbreviations for ip — > and ip ip, and of 
course we use the standard semantics. We also write (n)ip for -i[7t]-k^; again 
this is standard. 

For example, let \ be the sentence [(even U odd)*](0 o -d). Then in any 
model Q, a point x has x \= \ iff f° r au points y reachable from x in zero or 
more steps in the relation even U odd, y satisfies exactly one of or 1. 

Proposition 54. If f : M — > N is a morphism, of models and x \= ip in M, 

then f{x) \= ip in N. 

Proposition 55. For every finite pointed model (Q,x) there is a sentence ip x 
of PDL so that for all (finite or infinite) F-coalgebras (H,y), the following are 
equivalent: 
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(i) V\=<Px in U. 

(ii) There is a bisimulation between Q and % which relates x to y. 
We call tp x the characterizing sentence of x. 

For infinitary modal logic, this result was shown in [4], and the result here 
for PDL is a refinement of it. 

For example, we construct a characterizing sentence for the Thue-Morse 
sequence M, see Fig. 1. Let tp and i[ be given by 

tp = A -d A (even}0 A [even]0 A (odd)l A [odd]l 
tp = -.0 A 1 A (even)l A [even]l A (odd)0 A [odd]0 

Then tp^ = tp A [(even U odd)*](tp V tp) is a characteristic sentence of the top 
node in Fig. 1; tp M also characterizes M in the following sense: the only stream 
a such that a |= tpM is M. 

Proposition 56. The following finite model properties hold: 

(i) If a sentence tp has a model, it has a finite model [12]. 

(ii) If tp has a model in which even and odd are total functions, then it has a 
finite model with these properties [5]. 

Remark 57. Our statement of the second result is a slight variation of what 
appears in [5]. 

We arrive at our third main result: 

Theorem 58. The following are equivalent for a E A": 

(i) a is 2-automatic. 

(ii) There is a sentence tp such that for all r e A w , t \= tp in (A", (hd, even, odd}} 
iff t = o-. 

Proof, (i) (ii): Let a be automatic, and let M be a finite F'-coalgebra and 
x G M be such that the unique coalgebra morphism / : M — > A" has f(x) = a. 
Let tp x be the characterizing sentence of x in M, using Proposition 55. By 
Proposition 54, a |= tp x in A". Now suppose that r |= tp x in A". Since tp x 
is a characterizing sentence, there is a bisimulation on A" relating a to r. By 
Lemma 24, a = r. 

(ii) => (i): Let tp be a sentence with the property that a is the only stream 
which satisfies tp. Since a has a model, it has a finite model, by Proposition 56. 
Moreover, this model M may be taken to be a finite F-coalgebra with a distin- 
guished point x. By [14, Theorem 5] let tp : M — > A" be the unique coalgebra 
morphism. Let r = tp(x). Since M is finite, r is automatic. By Proposition 55, 
t |= tp in A". But by the uniqueness assertion in part (2) of our theorem, we 
must have t = a. Therefore a is automatic. □ 
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6 Mix-Automaticity 



The zip-specifications considered so far were uniform, all zip-operations in a 
zip-fc specification have the same arity fc. Now we admit different arities of zip 
in one zip-specification (Definition 4). To emphasize the difference with zip-fc 
specifications we will here speak of zip-mix specifications. This extension leads 
to a proper extension of automatic sequences and some delicate decidability 
problems. 

Definition 59. A state- dependent- alphabet DFAO is a tuple (Q, E, 6, qo, A, A), 
where 

- Q is a finite set of states, 

- E = {T, q } q£ Q a family of input alphabets, 

S = {5 q : Eg — > Q} q< zQ a family of transition functions, 
qo G Q the initial state, 

- A an output alphabet, and 

A : Q — > A an output function. 

We write S(q,i) for S q (i) iff i G E g , and extend S to words as follows: Let q G Q 
and to = a n -i . . . do where dj G E ri (0 < i < n) with r, G Q defined by: ro = q 
and rj+i = S(r i} ai). Then we let to) = r n . 

A state-depcndcnt-alphabct DFAO can be seen as a DFAO whose transition 
function is a partial map S : Q x (J E — ^ Q such that a) is defined iff a G E g . 

We use this concept to generalize fc-DFAOs where the input format arc 
numbers in base fc by the following two-tiered construction. We define P-DFAOs 
where P is a DFAO determining the base of each digit depending on the digits 
read before. Thus P can be seen as fixing a variadic numeration system. For 
example, for ordinary base fc numbers, we define P to consist of a single state q 
with output fc and edges 0, . . . , fc — 1 looping to itself. 

Definition 60. A base determiner P is a state-dependent-alphabet DFAO of 
the form P = (Q, {N <( g( g )} 9 , 5, qo, N, /?}. The base-P representation of n G N is 
defined by 

(n) P = (n) P;90 where (n) P;9 = (n')p,5( 9 ,d) • d 

with n' = L^jJ an d d = [n]p( q ), the quotient and the remainder of division of 
n by /3(g), respectively. 

A P-DFAO A is a statc-dependent-alphabet DFAO 

A=(Q',{N <W) } ?w ,^, go ,A,A) 

compatible with P, i.e. (Q', {N <| g'( 9 ')} g / e Q', 5', g , N, /?'} and P are bisimilar. 
A mix-DFAO is a P-DFAO for some base determiner P. 
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Note that the output alphabet of a base determiner can be taken to be finite 
as the range of /?. The compatibility of A with P entails that A reads the number 
format defined by P. Moreover, every mix-DFAO A — (Q, {N</3( 9 )} 9 gq, S, q , A, A) 
is aP A -DFAO where P A = (Q, {N <m } qeQ , 8, q , N, P). 

These DFAOs introduce a new class of sequences, which we call 'mix-automatic' 
in order to emphasize the connection with zip-mix specifications. 

Definition 61. Let P be abase determiner, and A = (Q, {^<p(q)} q eQi 8, qo, A, A) 
a P-DFAO. For states q e Q, we define {(A, q) e A" by: ((A, q){n) = \{5{q, (n) P )) 
for all n E N. We define ((A) = ((A, q ), and say A generates the stream £(A). 

A sequence a G A" is P-automatic if there is a P-DFAO A such that a = 
({A). A stream is called mix- automatic if it is P-automatic for some base 
determiner P. 

Example 62. Consider the following mix-DFAO A: 



(°) 1 (*) 




We note that A is a P-DFAO where P is the base determiner obtained from A 
by redefining the output for q 0} qi and qi as the number of their outgoing edges 
2, 3 and 2, respectively. 

As an example, we compute (5)a, and (23) a as follows: 

(5) w = (2) 91 l = (0)^21 = 21 
(23) 90 = (11) 91 1 = (3) 91 21 = (1) 92 021 = (0) 9o 1021 = 1021 

where (n) q denotes (n)A, q - The sequence ((A) begins with 

a : b : b : a : b : b : a : a : b : b : b : a : a : a : a: b : b : b : b : b : b : a : a : a: a : b : a : b : . . . 

with entries 5 and 23 underlined. E.g. X(6(qo, 1021)) = a since starting from q 
and reading 1021 from right to left brings you back at state qo with output a. 

We briefly indicate how to see that mix-automaticity properly extends au- 
tomaticity. Let a and r be k- and ^-automatic sequences. Then the stream 
zip(a, t) is mix-automatic, but not necessarily automatic, by Cobham's Theo- 
rem [6]. 

Proposition 63. The class of mix- automatic sequences extends that of auto- 
matic sequences. 

Definition 64. Let k : A" ->■ N>i, and let G be the functor G{X) = J2k=2 A x 
X k . We define the cobasis 

M K = (hd,ACT.(7T 0!K((T )((T),...,7r K ( o . ) _ 1 ^( (T) ((7))) 

An M K - observation graph is a G-coalgebra Q = (S, (o,n)) with a distinguished 
root element r £ S, such that there exists a G-homomorphism [•]:£—> A" 
from Q to the G-coalgebra (A", Af K ) of all streams with respect to Af K : 
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A" 



S 



(o,n) 



Er= 2 Ax(A") fe 



The observation graph Q defines the stream \r\ e A". A mix-observation graph 
is an A/" K -observation graph for some k. 

The following result is a generalization of Theorem 53. The key idea is 
to adapt Definition 33 by computing the derivatives 7To,fc(t)i, . . . , nk-i,k(t)i of 
a zip-term t where now k is the arity of the first zip-symbol in the tree un- 
folding of t. Moreover, we note that mix-DFAOs yield mix-observation graphs 
by collapsing states that generate the same stream (for each of the equivalence 
classes one representative and its outgoing edges is chosen). This collapse caters 
for mix-DFAOs which employ different bases for states that generate the same 
stream. 

Theorem 65. For streams a e A u the following properties are equivalent: 
(i) The stream a is mix- automatic. 

(ii) The stream a can be defined by a zip-mix specification. 

(in) There exists a finite mix- observation graph defining a. 

Example 66. The zip-mix specification corresponding to the mix-automaton 
from Example 62 is: 



We have seen that equivalence for zip-fc specifications is decidable (Theo- 
rem 37), and it can be shown that comparing zip-fc with zip- mix is decidable 
as well. In the next section we show that equivalence becomes undccidable 
when zip-mix specifications are extended with projections 71"^. But what about 
zip-mix specifications? 

Question 67. Is equivalence decidable for zip-mix specifications? 

7 Stream Equality is n^- complete 

In this section, we show that the decidability results for the equality of zip-fc 
specifications are on the verge of undccidability. To this end we consider an 
extension of the format of zip-specifications with the projections ni.k- 



X 
Xi 

x 2 



b:X[ 
b:X> 2 



a : Xn 



■o 



X = ZiP 2 (Xl,Xg) 

x i = zip 3 (X ,Xi,X2) 
X 2 = zip 2 (X ,X' 1 ) 
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where X G X, a G A, i, k G N. A zip v -specification consists for every X G X of 
an equation X = t where t G Z 7r (A, X). 

The class of zip^-specifications forms a subclass of pure specifications [10], 
and hence their productivity is decidable. In contrast, the equivalence of zip^-specifications 
turns out to be undecidable (even for productive specifications). 

Theorem 69. The problem of deciding the equality of streams defined by pro- 
ductive zip* -specifications is ^{-complete. 

For the proof of the theorem, we devise a reduction from the halting problem 
of Fractran programs (on the input 2) to an equivalence problem of zip^-specifications. 
Fractran [7] is a Turing-complete programming language. As intermediate step 
of the reduction we employ an extension of Fractran programs with output (and 
immediate termination): 

Definition 70. An Fractran program with output consists of: 

- a list of fractions 21, . . . , m. (k,p 1 ,q 1 ,.. .,p k , q k G N> ), 

- a partial step output function A:{l,...,fc}— 1 T 

where T is a finite output alphabet. A Fractran program is a Fractran program 
with output for which A(l)f, ■ • ■ , A(fc)t- 

Let F be a Fractran program with output as above. Then we define the 
partial function ( • ) : N — {1, . . . , k} that for every net) selects the index (n) 
of the first applicable fraction by: 



(n) = min { i I 1 < i < k, n ■ — G N } 



where we fix (min 0)f. We define f F : N -> N U T U {_L} by: 



'n-fg if (n)i and A«n»t 
fF(n) = I A((n» if and A((n))| 
± if(n)| 



for all n G N. The first case is a computation step, the latter two are termination 
with and without output, respectively. 



We define the output function X* F : N — 1 T U {±} of F by 




7 if 7 = /f(") G T U {±} for some i G N 
t if no such i exists 



If \* F (n)l then F is said to halt on n with output X* F (n). Then F is called 
universally halting if F halts on every n G N>o, and F is decreasing if < % 
for every 1 < i < fc with A(i)|- 
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For convenience, we denote Fractran programs with output by lists of anno- 
tated fractions where A(i)t is represented by the empty word (no annotation): 

^A(1),...,^A(*) 
qi qk 

Lemma 71 ([7]). The problem of deciding on the input of a Fractran program 
whether it halts on 2 is Sj- complete. 

We transform Fractran programs F into two decreasing (and therefore uni- 
versally halting) Fractran programs Fo and Fl with output such that F halts 
on input 2 if and only if there exists ntN such that the outputs of Fo and F\ 
differ on n. 

Definition 72. Let F — 21, . . . , 2t be a Fractran program. Let a\ < . . . < a m 
be the primes occurring in the factorizations of pi, . . . ,pk, qi, • ■ • , qk- Let Zi,z 2 ,c 
be primes such that zi,z 2 , c > rio<i<feP* ' an ^ z i > z 2 and z\ > 2 • c. 
We define the Fractran program F° with output as: 

simulate F cleanup 
*s — 



Pi Pk 1 1 



1 1 z 2 2-c 1 

"Xo, -, j ! TXb 



/V.L<< 11 1 7 1 , 

c ■ Z 2 C Zl ■ Zl Zl 1 

F halted initialization F did not ha 1 * 

Let F 1 be obtained from F° by dropping and 

Lemma 73. TTie programs F° , F 1 are decreasing and universally halting, and 
X* Fi (n) e {Xa,Xfc} /or a/I n g N, i £ {0,1}. 

Lemma 74. TTie following statements are equivalent: 

(i) \* F0 (n) = X* F1 (n) for all n G N >0 . 

(«; A^o^ 1 • z 2 e2 ) = Xp^zf 1 ■ zl 2 ) for all e u e 2 e N. 

(mj TTie Fractran program F does not halt on 2. 

Next, we translate Fractran programs to zip^-specifications. 

Definition 75. Let F = |J-A(1), . . . , ^A(fc) be a decreasing Fractran program 
with output. 

Let d := lcm{q u . . . ,q k ), and define p' n = d-p^/q^) and &„ = n-p( n )/q^ n ) 
for 1 < n < d; if (n)t, let p^t an d 6„t- We define the zip* -specification <S(F) 
for 1 < n < by: 

x o = ziPd( x i: • ■ - )X d ) 

X„ = 7r 67i „i !p; (X ) if (n)i and A«n»t 

X„ = A«n» : X„ if <n)| and A«n))| 

X„ = JL : X„ if (n)f 
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Lemma 76. Let F be a decreasing Fractran program with output. The zip™ - 
specification S(F) is productive and it holds that [Xo] 5 ^-* (n) = X* F (n + 1) for 
every neN. 

Proof of Theorem, 69. We reduce the complement of the halting problem of 
Fractran programs on input 2 (which is nj-complete by Lemma 71) to equiva- 
lence of zip^-specifications. 

Let F be a Fractran program. Define F°, F 1 as in Definition 72. By 
Lemma 73 both are decreasing. By Lemma 76 S(F l ) is productive, and we 
have [X l 5 ( F, )(n) = X* Fi (n + 1) for every n e N and i e {0,1}. Finally, by 
Lemma 74 it follows that S(F°) and S^F 1 ) are equivalent if and only if F halts 
on 2. 

The equivalence problem of productive specifications is obviously in since 
every element can be evaluated. □ 

Related work The complexity of deciding the equality of streams defined by 
systems of equations has been considered in [17] and [3]. In [17], Rosu shows 
n^-completeness of the problem for (unrestricted) stream equations. In [3], 
Balestrieri strengthens the result to polymorphic stream equations. However, 
both results depend on the use of ill-defined (non-productive) specifications 
that do not uniquely define a stream. The Ll^-hardness proofs employ stream 
specifications for which productivity coincides with unique solvability. As a 
consequence, both results depend crucially on the notion of equivalence for 
specifications without unique solutions. 

In contrast to [17] and [3], we are concerned with productive specifications, 
that is, every element of which can be evaluated constructively. Then equality 
is obviously in Li" . We show that equality is Ll^-hard even for a restricted class 
of polymorphic, productive stream specifications. 
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A Appendix 



Proof of Lemma 15. Introduce a fresh root Xq and add the equation X = X . 

□ 

Definition 77. A zip-specification S is called zip-guarded if every cycle in S 
contains an occurrence of zip. 

Example 78. Specification (1) of the Thue-Morse stream is zip-guarded. In 
contrast, the zip-specification: 

alt = zip(zeros, ones) zeros = : zeros ones = 1 : ones , 

specifying the stream 0:1:0:1:0... of alternating zeros and ones, is not 
zip- guarded. 

It is an easy exercise to show that zip- free cycles correspond to periodic 
sequences, and periodic sequences can be specified by zip-guarded zip-k specifi- 
cation (for arbitrary k). Hence: 

Lemma 79. Every zip-k specification can be transformed into an equivalent, 
zip-guarded zip-k specification. 

Proof. Every zip-free cycle M = . . . = c\ : . . . : c n : M characterizes a periodic 
sequence a = uuu . . . with u <G A*. Note that for every i, k G N, ir^k (c) is again 
periodic with a period length < |u|. Thus the A4-observation graph is finite (for 
every k) and hence by Lemma 41 we have a zip-guarded specification for a. □ 



Proof of Lemma 20. Let S be a zip-specification. Using Lemma 79 let S be 
zip-guarded. If S contains an equation of the form: 

X = ci : . . . : c m : zip fe (si, . . . , s iy . . . , s k ) (*) 

such that Si £ X for some 1 < i < k, then we pick a fresh X' and replace the 
equation by: 

X = ci : . . . : c m : zip fe (si, . . . , X', . . . , s k ) 

X' = Sl 

Clearly, the resulting specification is equivalent to the original, and still zip- 
guarded. We repeat this transformation step until there are no equations of 
form (*) left, that is, the arguments of every occurrence of zip fc -symbols are 
only recursion variables. 

Next, we replace equations of the form: 

X «;:...: c m : Y (f) 
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with Y a recursion variable, by (unfolding Y): 



X = ci : 



: c. 



rn 



: r 



where the defining equation for Y is Y = r. The obtained specification is 
equivalent and remains zip-guarded. Again, we repeat this step until there no 
longer are equations of form (f). This process is guaranteed to terminate since 
the specification is zip-guarded. 

In the final specification, every right-hand side of an equation contains a zip fe 
(for some k > 2), and this zip fc is applied to recursion variables only. Hence the 
final specification is flat. 

Furthermore, note that the resulting specification contains only zip fe -symbols 
for k > 2 for which a zip fe also occurs in the original specification. As a conse- 
quence, the transformation preserves zip-k specifications. □ 



Proof of Lemma 26. Let Q = (S, (o, n)) be a ^-observation graph, and let |-]2 : 
S — > A w be two F-homomorphisms from Q to S& = (A", 2?). Define the rela- 
tion R C A" x A" by R [t] 2 for all t 6 S. It is easy to check that R is a 
£>-bisimulation, and hence Js]i = ls} 2 for all s <E S by Lemma 24. □ 



Proof of Proposition 27. The final coalgebra for F is the fe-automaton of the 
A- weighted languages (or of the fc-ary trees with labels in A) (A fc , (o, n)) where 
k := {0,1,..., k}, o : A r -> A, L 4 L(e), and n : A k " (A fe *) fc , n{L) ^ 
(L\, . . . , Lk) with Li defined by Li(w) — L(i ■ w) for all w € k* (for the case 
k = 2 sec [14, Thm. 3]). The function / : k* ->• cj defined by m 0, and 
ao<ii...a n h-> yj™ =0 (l + is bijective and induces an isomorphism from 

(A", Ok) to (A fe , (o, n)), because it has the property that f(i ■ w) — kf(w) + i 
holds for alii € fe and w E k* . □ 



Proof of Lemma 34- The equations of S are of the form: 



We define m :— max {rrii < i < n }. 

Note that the root A is of the claimed form. Thus it suffices that TT it k{s)i 
is the shape whenever s 6 S is and < i < k. Let s = d : • • • : de-i : Xj G 5 
with £ < m, and let < i < fc. Then it holds that: 



,-i : zi Pfc( X j,o, • • ■ ,Xj,fe_i) 



(0 < j < n) 




abbreviate as D 



abbreviate as C 
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where a, b, i' and i" are denned by: 

a=[(t-l-i)/k\ i'=[t-l-t\ k 
b = l(m 3 - 1 - i')/k\ i" = [mj - 1 - i\ 

The number of elements in £)[(?[□]] is at most [(£+mj + l)/k\ (since iii.k 'walks' 
over I + m,j elements). Hence -DfCLY.,-^//]] is a normal form of the claimed form 
with a stream prefix of length < [(£ + rrij + l)/fcj < [(2m + l)/fcj < to. □ 

Lemma 80. For every productive, flat zip-specification S with root Xq it holds 
that: 

|fe (Xo) | <2-(|E| + l)-m-n + 4-m, 

where n is the number of recursion variables and m the longest prefix ( as defined 
in the proof of Lemma 34)- 

Proof. We use the notation from the proof of Lemma 34. Here we consider 
only case A/2 = (even, odd). The proof of the general case works analogous. We 
define 

T = {c\ : . . . : Ck :Xi \ k < m, Ci data elements, i < n} 

We have <5s(Xo) C T by the proof of Lemma 34. 

For a set S of terms we define the one step derivatives 5(S) as follows: 

6(S) = {even(s)|, odd(s)| | s e S} 

We consider a term t e T: 

t = ci : . . . : c k : Xj 

with k < m. Then even(f)| and odd(t)| are of the form: 

c h : . . . : c tk , : t' with t' G 5(X t ) (3) 

where k' < \k/2] and Cj x , . . . , Ci k , are residuals of ci, . . . , Ck- 

Then by induction it follows that every term s G <5r io S2 fe l (^ nas th e form: 

s = t' or s = Cj : t' with t' e 5^ fcl (X,) (4) 

with 1 < j < k. Furthermore, for / > k, every s G S^ og2 ^(t) has the form (4) 
with j G {1, . . . , k}. Hence it follows: 

n— 1 / 

s [iog 2 ml (T) C |J ( M log 2 ra l(X,)u|Jc: <5 riog 2 ™1 (X,) 
i=o ^ ces 

and therefore: 

n-l 

| 5 ri°g 2 ml( T )| < (|E| + 1) • I y 5riog 2 ml( X .)| 

i=0 
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Since \S^° &2 m l (Xj)| = 2^2™! W e get: 

|^nog 2 ™l(T)| < + 1) • 2 riog 2 m T • n 
< + 1) -2-m-n 

Let <^*(t) = Uj-<j 5 J '(t)- Then <5 S (X ) C ^f 1 **" ml ( Xq ) y tfP°S2 fcl ( T ) and hence: 

|5!s(Xo)| < |^ riog 2 ml (X )| + |(5 rioS2fcl (T)| 

< (1 + 2 + 4 + ... + 2^2 H ) + (|s| + 1) . 2mn 

< 4 • m + + 1) • 2 • m ■ n n 



Proof of Proposition 38. By Lemmas 34 and 80 the observation graphs of pro- 
ductive and flat zip-specifications are finite and have polynomial size. By Propo- 
sition 29 we can decide bisimilarity of observation graphs in linear time. Con- 
sequently equality of zip-specifications is decidable in polynomial time. □ 



Proof of Proposition 55. First, we have a construction that works for any model 
M. For every point a £ M and every h, we define the formula ip 1 ^ a . The 
definition is by recursion on h (simultaneously for all x £ M) as follows: ip° M a is 
the conjunction of all atomic propositions (0 or 1) satisfied by a and all negations 
of atomic propositions not satisfied by a. Given (^^,6 f° r au & e ^f) we define 

^M + a = A < eVen >^M,h A t eVen ] V VM.b 
(a,b)£even (a,b)eEeven 

A f\ (odd)^ )6 A [odd] \/ y h M>b 

(a,h)Godd (a,6)Godd 
A ¥>M, a 

We always identify sentences up to logical equivalence. 

As M ranges over all models and a over the points of M, we call the sentences 
fM.a tne canonical sentences of height h. 

Lemma 81. The following hold: 

(i) For all h, there are only finitely many sentences tp 1 ^ a . 

(ii) For every h, every world of every model satisfies a unique canonical sen- 
tence of height h. 

(Hi) If R is a bisimulation relation between models M and N, and if a R b, 
then tp h M a = ip% b . 
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(iv) If M and N are finitely branching models, then the converse of part (Hi) 
holds: the largest bisimulation between M and N is the relation R defined 
by 

aRb iff for all h, ip h Ma = tp h Nh (5) 

Proof. For parts (i) and (ii), see Proposition 2.5 and Lemma 2.6 of [15]. (The 
arguments there are for one modality, but the results extend in an obvious way 
to our setting.) Part (hi) is a standard fact, as is the "Hennessy-Milner" result 
in part (iv). □ 

Now we return to Proposition 55. We hx a finite model M and some point 
x in it. Let R be the largest bisimulation on M. It is a general fact that R is 
an equivalence relation, and indeed this also follows from the characterization 
in (5). For all a, b in M, if -i(a R b), then there is some natural number h so 
that tp M a 7^ tp M b . Since M x M is a hnite set, there is some fixed h* so that 
for all a, b £ M, if -i(a R b), then ip M a ^ ip M b . The key consequence of this is 
that for all a,b £ M, 

if </>M,a - <P&,b, thm als0 tfj 1 = vXlf (6) 

From this and Lemma 81, part (ii), it follows that for all a, b e M, 

if « h Vmm thcn also a N Vmj, 1 ( ? ) 
For a £ M, let i/j a denote the formula 

</>M,a ->• f A ( even )^M,h A [even] \/ ^ >6 

(a,fc)6even (a,b)Geven 

A /\ (odd)^K,6 A [odd] \/ ^,6 

(a,fc)Godd (a,6)Godd 

Using (7), we see that for all a, b £ M, a\= ip b . 

We now finish the proof of Proposition 55. We have our model M and a 
point x <GM. We take the characterizing sentence of x in M to be 

ip x = (p MiX A [(even U odd)*] f\ ip a 

aeM 

It is easy to see that x \= (p x - To end our proof, suppose that N is any model, 
and y £ N satisfies ip x . We define a bisimulation R between M and N which 
relates x to y: 

aRb iff a is reachable in M from x using (even U odd)*, 
b is reachable in N from y using (even U odd)*, 

and VM,a = ¥>N,b 

The definition of tp a ensures that R is a bisimulation. 

This completes the proof. □ 
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As an example of the construction in the above proof we obtain the following 
sentence <^m characterizing the Thue-Morse sequence M: Let {n}(p abbreviate 
(tt)lp A [n]<p, and let 

(p = A -.1 A {even}0 A {odd}l 
ip = -.0 A 1 A {evenjl A {odd}0 

Then 

ip M = (p A [(even U odd)*] ^{ip — > {evenjtp A {odd}?/>) 

A (ip — > {even}?/; A {odd}(^) 
A (0 ->■ {even}0) 
A (1 ->■ {even}!) 



Theorem 82 (Cobham [6]). Let k,i > 2 be multiplicatively independent (i.e., 
k a ^ l b , for all a, b > 0), and let a G A" be both k- and (.-automatic. Then a 
is eventually periodic. 

Proof of Proposition 63. Let k, ( be multiplicatively independent integers, and 
let a G A A and t G A b be k- and ^-automatic sequences, generated by DFAOs 
A = (Q A ,N <k ,5 A ,q A fi,A A ,X A ) and B = (Q B ,N <e ,S B , q B . , A B , \ B ) , respec- 
tively. Assume that both a and r are not eventually periodic. 

We show that the sequence v = z\p(a, r) G (A^ U A B )^ is mix-automatic, 
but that there is no integer m > 2 such that v is m-automatic. For the first, note 
that v is generated by the mix-DFAO C — (S, {^<p( 8 )}ses, S, s , Aa U A B , A), 
where S = {s } U Qa U Q b , and with j3, 5 and A defined by 

P(s ) = 2 S( So ,0)=q \(s ) = \ A (q ) 

S(s , 1) = r 

P(q) = k 5(q,a) = 6 A (q,a) X(q) = X A {q) 

[3{r)=£ S(r, b) = 8 B (r, b) A(r) - \ B (r) 

for all q G Qa, t G Qb, a € N<fc and 6 G N<£. Now suppose u is m-automatic, 
for some m > 2. Then, as m-automaticity is closed under arithmetic subse- 
quences, also a and r are m-automatic. By Theorem 82 and the assumption 
that cr, t are not eventually periodic, it follows that k and m are not multi- 
plicatively independent, and likewise for I and m. But then k and t are not 
multiplicatively independent, contradicting our assumption. □ 

Lemma 83. Every decreasing Fractran program with output is universally halt- 
ing. 

Proof of Lemma 83. Then /f(ti) < n or /f(ti) G T U {_L} for every n G N. 
Hence F is universally halting. □ 
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Proof of Lemma 73. By the choice of z\ , z^ and c we have q > p for all annotatcd- 
free fractions 2 from F° and F°. Hence F° and F° are decreasing and by 
Lemma 83 universally halting. 

Since F° and F 1 contain jXb, it follows that the programs always terminate 
with output, that is, either \a or Xb- D 



The following lemmas use the notation from Definition 72: 

Lemma 84. For every n G N we /icroe A^ (2 • c • ) = Xa if and only if the 
Fractran program F halts on input 2 within n steps (that is, 3n' < n. fp 1 (2) = 

j-;. 

Proof of Lemma 84- On inputs of the form 2-c-z^ the program F° behaves as F 
except for removing a prime z 2 in each step. By definition of F° and induction 
we obtain 

fr(2. C .^) = /;(2). C .zr 

for every i < n such that Vj < i. f F (2) ^ _L. 

Assume that there is < n' < n such that // _1 (2) G N and fp' (2) = _L, 
that is, F halts in precisely n' steps. Then in F° to the value fp ~ 1 {2 ■ c ■ z%) = 

fp _1 (2) ■ c- Z2~^ n 1 ' none of the 'simulate F'-fractions is applicable, and after 
the 'cleanup'-fractions have removed all primes occurring in F, the fraction 
Xa will result in termination with output Xa (the fraction is applicable since 
n - \n' - 1) > 0). 

On the other hand assume fp (2) G N for all n' < n, that is, F does not 
terminate within n steps. Then fp (2 ■ c ■ z%) — fp{2) ■ c. As a consequence, 
in F° the 'simulate F'-fractions and likewise -^Xa are not applicable (lacking 
Z2), and thus the 'cleanup'-fractions will remove all primes occurring in F and 
then - removes the c. Finally, only jXb is applicable, resulting in termination 
with output Xb- □ 

Lemma 85. It holds that fF°{n) = /p-i(n) for all n e N such that any of 
ai . . . , a m or c divides n. 

Proof of Lemma 85. For every p G {ai . . . ,a m ,c} there is a fraction i in the 
common prefix of F° and F 1 . □ 



Proof of Lemma 74- The equivalence of (i) and (ii) is a consequence of Lemma 85 
since z\ and zi are the only remaining primes from F° and F 1 not covered by 
this lemma. 

Consider (ii) (iii). Let n G N, we show that F does not halt on 2 within 
n steps. We have fpo{zi ■ z%) = 2 ■ c ■ zV, and fp\{z\ ■ z%) = Xb, and thus 
X* F o(2 ■ c ■ z%) — Xb by (ii). We conclude this case with an appeal to Lemma 84. 
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For (iii) (ii), let ei, e 2 £ N. We have (z^ 1 -zj 2 ) = Xb- Assume e x = 2-n, 
n G N then /^(^i 1 ■ zf) = z" +e2 and hence A^^ 1 • z| 2 ) = (by definition of 
F°). Otherwise ei = 2 • n + 1 and we obtain /"oOi 1 ' z ?) = 2 ' c • z" +e2 . Thcn 
it follows A^ (zi 1 • ^2 2 ) = Xf> by Lemma 84. □ 



Proof of Lemma 76. Let [•] be a solution for S(F). We use the notation from 
Definition 75. We prove 

lX f^(n) = \* F (n + l) 

by induction on n G N. 

Let n G N and m G A r <( j, o G N such that n = m + od. For every 1 < i < k: 
( n +l).a = ( TO + l).a+od.a. By choice of d: G N, hence (n+ 1) • — G 

N^>(m+l)'|eN and thus (n + 1) = (m + 1) and*A((ra + l)) = A((m + i». 

We have [X ](n) = |X m+ i](o). If (n + l)t then (m + l)t and hence 
pT ](n) = [X m+ i](o) = _L. Likewise for (n + 1)1 and A((n + we obtain 

pfo](n) = A«n+l». 

The remaining case is (n + 1)4- and A((n + l))t- Then 

[Xo](n) = ( 7 r 6m+1 _ 1 , pUi ([X ]))( ) 
= [X ](6 ro+ i-l+^ +1 -o) 
= [X ]((m + l)-^±i> -l + d.^±i> . ) 

9<n+l) 9(n+l> 

= [X J(( m + l + d ).^±i> -1) 

9<n+l) 

= [X ](/ F (n+1)-1) 

= X* F (f F (n + 1)) by induction hypothesis 

= A F (n + l) 

Finally, we note that since the derivations from pTo](n) to [XoJ(/f(« + 1) — 
1) also exist on the level of rewrite sequences in the zip-specification, and by 
decreasingness it holds /f(« + 1) — 1 < n, it follows that the specification is 
productive. □ 
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